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Abstract. We consider the approximation problem of a pseudo-Boolean func- 
tion by a symmetric pseudo-Boolean function in the sense of weighted least 
squares. We give explicit expressions for the approximation and provide in- 
terpretations and properties of its L-statistic representation. We also discuss 
applications of these expressions in cooperative game theory and engineering 
reliability. 



1. Introduction 

Hammer and Holzman [1] investigated the problem of approximating a pseudo- 
Boolean function /: {0,1}™ — > K by another pseudo-Boolean function of smaller 
degree in the sense of standard least squares, where the degree of a pseudo-Boolean 
function is that of its unique multilinear polynomial representation. As an appli- 
cation, they showed that the Banzhaf power index, an important concept in coop- 
erative game theory, appears as the coefficients of the linear terms in the solution 
of the approximation problem by functions of degree at most one. 

Motivated by this work, in this paper we consider the problem of approximat- 
ing a pseudo-Boolean function by a symmetric one (i.e., invariant under permuta- 
tion of its variables). That is, we consider the closest symmetric pseudo-Boolean 
function to a given pseudo-Boolean function in the sense of least squares. To be 
general enough, we actually consider arbitrarily weighted least squares. We give 
explicit expressions for the approximations as symmetric multilinear polynomials 
and shifted L-statistic functions and we discuss a few properties of these approx- 
imations (Section 2). We then investigate the shifted L-statistic representation of 
the approximations and interpret their coefficients as a way to measure the influ- 
ence of the fcth largest variable (for arbitrary k $J n) on the given pseudo-Boolean 
function (Section 3). Finally, we discuss two applications of the approximation 
problem: defining a new concept of influence of players in cooperative game theory 
and interpreting system signatures used in engineering reliability (Section 4). 

We employ the following notation throughout the paper. We denote by B the 2- 
element set {0,1}. For any S C [n] = {1, . . . , n}, we denote by Is the characteristic 
vector of S in {0, 1}™ (with the particular case = l^). 

Recall that if the B- valued variables x%, . . . ,x n are rearranged in ascending order 
of magnitude $J • • • ^ xi n \, then X/^) is called the kth order statistic and the 
function os^ : B" — > B, defined as osfe(x) = X(u\, is the kth order statistic function. 
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We then have osfc(x) = 1, if Y2?=i Xi ^ n> — k + 1, and 0, otherwise. As a matter 
of convenience, we also formally define oso = and os n +i = 1. An L-statistic 
function is a linear combination of the functions osi,...,os n while a shifted L- 
statistic function is a linear combination of the functions osi, . . . , os„+i. 

2. Symmetric approximations 

In this section we present and solve the approximation problem of pseudo- 
Boolean functions by symmetric pseudo-Boolean functions and we discuss a few 
properties of the approximations. 

Through the usual identification of the elements of B™ with the subsets of [n] , 
an n-ary pseudo-Boolean function /: B" —> R can be equivalently described by a 
set function Vf : 2<- n ' — > R. The correspondence is given by Vf(S) = /(Is) and 

(1) /(x)= £ v f (S)Y[ Xl [] (!-^)- 

SC[n] ieS »e[n]\S 

Equation ([I]) shows that any n-ary pseudo-Boolean function / can always be 
represented by a multilinear polynomial of degree at most n (see [3] ) , which can be 
further simplified into 

(2) /(x)= ]T a f (S) Y[ Xll 

SC[n] i£S 

where the set function a/: 2^ — > R, called the Mobius transform of Vf, is defined 

by _ 

«/(5)= £(-l) |SHT| «/(T). 

res 

The following proposition gives two equivalent conditions for a pseudo-Boolean 
function to be symmetric, including a description as a shifted L-statistic function. 
A set function v: 2^ — > R is said to be cardinality-based if f(S') = v(T) for ev- 
ery S,T C [n] such that | -S' | — \T\. Equivalently, there exists a unique function 
v: {0, 1, . . . ,n} R such that = wdS 1 !). 

Proposition 1. J/ / is a pseudo- Boolean function then the following assertions are 
equivalent. 

(i) / is symmetric. 

(ii) is cardinality-based. 

(iii) f is a shifted L-statistic function. 

Proof, (iii) (i) O (ii) Trivial. 

(ii) (iii) Ii Vf is cardinality-based, then using ([1]) we obtain 

n n 

/( x ) = ^2v f (s) ^ II Xl II = X)"/( S )( a; (»- s + 1 ) _:C (n-*))' 
s=0 SC[n] i£S iE{n]\S s=0 
\S\=s 

which completes the proof. □ 

Denote by F(M n ) the set of n-ary pseudo-Boolean functions and by Fs(M n ) the 
subset of symmetric n-ary pseudo-Boolean functions. Since Fs(M n ) is precisely 
the set of L-statistic functions (see Proposition [T]), it is spanned by the linearly 
independent set B = {osi, . . . ,os„,os n +i} and thus is a linear subspace of F(M n ) 
of dimension n + 1 . 
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Given a weight function w: B" —> ]0, oo[ and a function / G F(M n ), we define 
the best symmetric approximation of f as the unique function Jl G -Fs(B n ) that 
minimizes the weighted squared distance 

ii/- »ir = e ^(x)(/(x)- 5 (x)) 2 

x£B" 

among all symmetric functions g G Fg(B n ). Note that || • || is the norm associated 
with the inner product (f,g) = £]xeB™ w ( x )/( x )5( x )- 

We can assume w.l.o.g. that the weights w(S) are multiplicatively normalized 
so that X^xeB" w ( x ) = 1- We then immediately see that the weights define a 
probability distribution over B™. 

Clearly, the solution of this approximation problem exists and is uniquely deter- 
mined by the orthogonal projection of / onto Fs(M n ). We then write Jl — A(f). 

The next theorem gives an explicit expression for A(f). For every / G F(B n ), 
define Vf. {0,1,..., n} — > R as Vf(s) = E(f{x) | |x| = s)Q In the special case 
when the weight function w is symmetric, we obtain 

= -pr E /W- 

w |x|=s 

Theorem 2. The best symmetric approximation of f G i f, (B n ) is given by 

n+l 

(3) A(/) = ^c, 0Sj -, 

where cj = Vf{n — j + 1) — tJ/(n — j) /or ewer?/ j G [n] and c n+ i = Vf(0) = /(0). 
Proof. The projection is characterized by the conditions 

(4) (/-A(/),osi)=0 (<G[n+l]), 
that is, 

(5) ^ «,(x)(/(x)-A(/)(x))=0 (i€[n+l]). 

We observe that conditions ([S]) remain equivalent if we replace the inequality |x| ^ 
n — i + 1 with the equality. Using ([3]) , we then obtain 

( 6 ) (E c ^( E = E "(*)/(*) (*e[» + i]) 

^ J=i ' ^ |x|=n— »+l |x|=n— i+1 

We finally obtain the result by subtracting equation i + 1 from equation i. □ 

The next corollary yields alternative expressions for A(f) as a shifted L-statistic 
function and symmetric multilinear polynomials. 

Corollary 3. The best symmetric approximation of f G F(M n ) is given by 

n 

(7) A(/)H/:l) + E c iK-K>l>), 

3 = 1 



Note that this notation is consistent with that introduced for cardinality-based set functions. 
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where Cj = Vf(n — j + 1) — t>/(ri — j) for every j G [n]. Also, 

(8) A(/)(x)= J2 Vf(\S\) U x i II (!-*<)= E ^(/)(I 5 D 11^' 

SC[n] «GS iG[n]\S SC[n] t£S 

wherea A{f) {s) = A s k v f (k)\ k=0 . 

Proof. Equation follows from ([3]) and © (for i = n+1). To establish ©, by fll} 
and ([2]) we only need to prove that VA(f)(s) = v/(s) and a A (f)(s) = A£ Uf(k)\ k= o. 
Let 5 C [n] such that l^l = s. Then, using ([3]) and ((6]), we obtain 

n+l 

v AU) ( S ) = A(/)(l s ) - £ c i = ^« I l x l = s ) 
and hence, since &A{f) is cardinality-based, 

a AU) {s) = aA ( /)(5) = E(-ir |T W)(i T i) = E(")(-ir^(/)W 

= A|u/(fc)| fe=0 , 

which completes the proof. □ 

We now examine the effect of a permutation of the variables of / on the sym- 
metric approximation A(f). Let S n denote the symmetric group on [n]. A permu- 
tation ir € S n acts on a pseudo-Boolean function / G F(M n ) by 7r(/)(xi, . . . , x„) = 
. . . ,£,r( n ))- A permutation n G S!„ is said to be a symmetry of / G _F(B") 
ifT(/) = /. 

Proposition 4. If n E S n is a symmetry of the weight function w, then for every 
/GF(B") we have A(n(f)) = A(f) and \\ir(f) - A(f)\\ = \\f - A(f)\\. 

Proof. If 7r is a symmetry of w, then clearly it is an isometry of F(M n ), that 
is, (7r(/),7r(<?)) = (f,g). Now, if g G F s (M n ), then by Q, we have <7r(/), ff ) = 
M/),7r(s)> = (/,<?) = <A(/), 5 ), which shows that A(tt(/)) = A(f). Using a 
similar argument, we obtain 

||7r(/)-A(/)|| 2 = (n(f)-A(f),n(f)-A(f)) = (f A(f), f A{f)) 

= \\f-Mf)\\ 2 , 

which completes the proof. □ 

With any pseudo-Boolean function / G F(B"), we can associate the symmetric 
function Sym(/) = ^ X^eS ""(/)• We then have the following result. 

Corollary 5. If the weight function w is symmetric, then for every f G F(M n ) we 
haveSjm(f) = A(Sym(f))=A(f). 

Proof. The first equality follows from the symmetry of Sym(/). The second one 
follows from Proposition 0] and the linearity of the projector A. □ 

Remark 1. Let F k (M n ) be the class of pseudo-Boolean functions of degree less than 
or equal to k and let A k be the orthogonal projector onto F k (R n ) (see [21 H])- If w 
is symmetric, then the best approximation of / 6 F(M n ) by an element of Fg(M n ) n 
F k (M n ) is given by A(A k (f)). In fact, by Corollary El the function A(A k (f)) lies 
in F s (B n ) n F k (M n ) and we have 

/ - A(A k (f)) = (/ - A k (f)) + (A k (f) - A(A k (f))), 
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where / — Ak(f) is orthogonal to F k (M n ) and Ak(f) — A(A k (f)) is orthogonal to 
Fs(M n ). Using the properties of Ak, we can also show that Ak(A(f)) — A(Ak(f)). 

We end this section by analyzing the effect of dualization of / on the symmetric 
approximation A(f). The dual of a function / G F(B n ) is the function f d G F(M n ) 
defined by f d (x) = 1 — /(lr n i — x). The following result is straightforward. 

Proposition 6. If the weight function w satisfies tu(lr n ] — x) = w(pc), then for 
every f G F(M n ) we have Vfd(s) = 1 — Vf (n — s). 

3. Influence of the km largest variable 

Following Hammer and Holzman's approach [3] , to measure the global influence 
of the fcth largest variable xr k ) 011 an arbitrary pseudo-Boolean function / G F(B"), 
it is natural to define an index /: F(M n ) x [n] — > R as I{f,k) — Ck, where Ck is 
defined in Theorem O 

Definition 7. Let I: F(M n ) x [n] ->• R be defined as /(/, k) = v f (n - k + 1) - 
v f (n-k). 

Thus we have defined an influence index from an elementary approximation 
(projection) problem. Conversely, the following result shows that A(f) is the unique 
function of Fs(B") that preserves the average value and the influence index of /. 

Proposition 8. A function g G i*s(B n ) is the best symmetric approximation of 
f G F(W l ) if and only if (f, 1) = (g, 1) and /(/, k) = I(g, k) for all k G [n]. 

Proof. The necessity is trivial (use Eq. ([1J for i = n + 1). To prove the sufficiency, 
observe that any g G i<s(B™) satisfying the assumptions of the proposition is of the 
form 

n n 

g = 5 (0) + J2 H9, ]) osj = g(0) + ^ I(f, j) os r 

3=1 3=1 

We then have g(0) + £™ =1 1(f,j) (o Sj , 1) = (g, 1) = (/, 1). Using ©, we finally 
obtain g = A(f). □ 

We now give a few properties of the index !(/, k). 

First, we see from Proposition|3]that I(7r(/), k) = I(f, k) whenever ir is a symme- 
try of w. Similarly, we see from Corollary [S] that /(/, k) ~ /(Sym(/),fc) whenever 
w is symmetric. 

We also see from Proposition [6] that, if w satisfies w(lr n i — x) = u>(x), then 
I(f d ,k) = I(f,n-k + l). 

We also have the following immediate property 

Proposition 9. For every fixed k G [n], the mapping f <—¥ I(f, k) is linear. 

It is a well-known fact of linear algebra that a linear map on a finite dimensional 
inner product space can be expressed as an inner product with a fixed vector. The 
next proposition gives the explicit form of such a vector for /(•, k). To this extent, 
for every k G [n] we introduce the function gk : B n — > R as <7fc(x) = Afe(dfeAfcOSfc_i), 
where d k = -1/ £|x|=n-fc+i 

Proposition 10. For every f G F(M n ) and every k G [n], we have I{f,k) = (f,gk)- 

Proof. We have d k+1 (f, A k os k ) = d k+1 Y,\ x \=n-k w ( x )/( x ) = ~^/( n ~ which 
leads immediately to the result. □ 
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Proposition [TU] shows that the index /(/, k) is the covariance of the random 
variables / and g k . Indeed, we have /(/, k) = E(f g k ) = cov(f,g k ) + E(f) E(g k ), 
where E(g k ) — (1, g k ) = 1(1, k) = 0. From the usual interpretation of the concept 
of covariance, we see that an element x £ B™ has a positive contribution on /(/, k) 
whenever the values of /(x) — E(f) and g k (x) — E(g k ) = g k (x) have the same sign. 
Note that g k (x) is positive whenever x^ is greater than the value (d k +iX( k+ i) + 
dfcX(fe_ 1 ))/(c?/ £ _|_i +dfe), which lies in the range of X(k) when the other order statistics 
are fixed at x. 



We now discuss two applications of the best symmetric approximation: defining 
a new concept of influence of players in cooperative game theory and interpreting 
system signatures used in engineering reliability. 

4.1. Influence of players in cooperative games. In cooperative game theory, 
the set [n] stands for the set of players, and its subsets are called coalitions. A game 
is a set function v: 2^ — > R with v(0) — 0. One thinks of v(S) as the worth of the 
coalition S. A game v is said to be symmetric if v is cardinality-based. 

In applications, it may be useful to approximate a given game v on [n] by a sim- 
pler one, for instance by a game whose corresponding pseudo-Boolean function has 
a degree strictly less than n. This problem was solved in [3J0], where interpretations 
of the so-called Banzhaf power and interaction indexes were given. Alternatively, 
it may be interesting to consider the best symmetric approximation of v, that is, 
the closest symmetric game to v. Since v can be seen as a pseudo-Boolean func- 
tion / E F(M n ) that assumes the value at 0, the closest symmetric game to v 
is precisely the game defined by A(f). We can then interpret the influence index 
/(/, k) = v(n — k + 1) — v(n — k) as the mean contribution of an arbitrary player 
to an arbitrary (n — fc)-coalition. 

4.2. System signatures in engineering reliability. Consider a system con- 
sisting of n components. When the components have i.i.d. lifetimes X\, . . . , X n , 
the signature of the system is defined as the n-tuple (si, . . . , s n ) £ [0,1]" with 
Si = Pr(T = Xi- n ), where T denotes the system lifetime. That is, Sj is the prob- 
ability that the ith component failure causes the system to fail. (For a recent 
reference, see [6].) It was proved [1] that 



where <fi: B" — > B is the structure function of the system. Thus we have Si = 
v<p(n — i + 1) — v<f,(n — i), assuming that the weight function w is symmetric. We 
then observe that Si is precisely the index I((f>,i), which can be interpreted as the 
influence on the system of the component that has the ith largest lifetime^ 

It follows that any expression involving the system signature can be reformulated 
in terms of the influence index for the structure function </>. For instance, as far as 
the system reliability is concerned, we have the following theorem (see O p. 27]). 



Note that this influence index is obtained from a standard least squares approximation prob- 
lem of the structure function. 



4. Applications 
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Theorem 11. For i.i.d. lifetimes X%, . . . , X n , we have F(t) = Y^7=i ^(^' *) ^i--n(t) 
for all t > 0, where F(t) = Pr(T > t) and F i:n (t) = Pr(X l: „ > t). 

5. Conclusion 

Wc have defined an influence index from the weighted least squares approxima- 
tion problem of pseudo-Boolean functions by symmetric ones, with an interpretation 
in game theory f £|4.1[) . In the non- weighted case, where the approximation reduces 
to symmetrization in the usual sense (Corollary [5]), we have shown that the influ- 
ence index coincides with the signature of a system made up of i.i.d. components 
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